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TECHNICAL FIELD 
The present invention pertains generally to signal analysis and synthesis filter banks 
such as those that may be used in audio and video coding systems. More particularly, the 
present invention pertains to analysis and synthesis filter banks implemented by a cascade of 
block transforms that are able to adapt the time and frequency resolutions of the filter banks. 

BACKGROUND ART 

Coding systems are often used to reduce the amount of information required to 
adequately represent a source signal. By reducing information capacity requirements, a signal 
representation can be transmitted over channels having lower bandwidth or stored on media 
using less space. Coding can reduce the information capacity requirements of a source signal 
by removing either redundant components or irrelevant components in the signal. So called 
perceptual coding methods and systems often use filter banks to reduce redundancy by 
decorrelating a source signal using a basis set of spectral components, and reduce irrelevancy 
by adaptive quantization of the spectral components according to psycho-perceptual criteria. 

Many perceptual coding systems implement the filter banks by block transforms. In 
an audio coding system, for example, a source audio signal, which is represented by time 
segments or blocks of time-domain samples, is transformed into sets of fi*equency-domain 
coefficients representing the spectral content of the source signal. The length of the segments 
establishes both the time resolution and the frequency resolution of the filter bank. Time 
resolution increases as the segment length decreases. Frequency resolution increases as the 
segment length increases. Because of this relationship, the choice of segment length imposes 
a trade off between the time and frequency resolution of a block transform filter bank. 

No single choice of segment length can provide an optimum trade off between 
resolutions for all of the source signal conditions that are encountered by typical coding 
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systems. Slowly varying or stationary source signals generally can be encoded more 
efficiently if the filter bank has a higher fi-equency resolution, which can be provided if a 
longer segment length is used. Rapidly varying or highly non-stationary source signals 
generally can be encoded more efficiently if the filter bank has a higher time resolution, 
5 which can be provided if a shorter segment length is used. By adapting the segment length in 
response to changing source signal conditions, a block transform filter bank can optimize the 
trade off between its time and frequency resolution. 

A large variety of transforms may be used to implement filter banks in audio coding 
systems, for example, but a particular Modified Discrete Cosine Transform (MDCT) is 

10 widely used because it has several very attractive properties for audio coding including the 
ability to provide critical sampling while allowing adjacent source signal segments to overlap 
one another. The MDCT is also attractive because it is able to remove substantially all 
redundant components in a source signal that is substantially stationary within a segment. 
Proper operation of the MDCT filter bank requires the use of overlapped source-signal 

15 segments and window fimctions that satisfy certain criteria described in Princen et al., 
"Subband/Transform Coding Using Filter Bank Designs Based on Time Domain Aliasing 
Cancellation," Proc. of the 1987 Intemational Conference on Acoustics, Speech and Signal 
Processing (ICASSP), May 1987, pp. 2161-64. Unfortunately, it is difficult to adapt the time 
and frequency resolution of MDCT filter banks in response to signal conditions because of 

20 the requirements imposed on the window functions that must be applied to overlapping 
source signal segments. 

One known technique sometimes referred to as "window switching" is able to adapt 
the time resolution of a MDCT filter bank by adaptively switching between two different 
window functions in response to the detection of certain signal conditions such as abrupt 

25 signal changes or amplitude transients. According to this technique, which is described in 
U.S. patent 5,214,742 by Edler, issued May 25, 1993, and incorporated herein by reference, 
segment lengths are not changed but the time resolution is adapted by switching between 
different window fiinction shapes to reduce the number of non-zero samples in each segment 
that are transformed by the filter bank. Unfortunately, this technique does not adapt the 

30 frequency resolution of the filter bank and the frequency selectivity of the filter bank is 

seriously degraded whenever the time resolution is reduced because the shape of the window 
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functions needed for window switching must be suboptimal to satisfy the requirements for 
proper operation of the MDCT. 

Another known technique sometimes referred to as "block switching" is similar to the 
window-switching technique mentioned above in that it also switches between different 
5 window function shapes, but the block-switching technique is able to adapt both time and 
frequency resolutions of a MDCT filter bank by also adaptively switching between two 
different segment lengths in response to the detection of certain signal conditions such as 
abrupt signal changes or amplitude transients. This technique is used in the Advanced Audio 
Coder (AAC), which is described in Bosi et al., "ISO/IEC MPEG-2 Advanced Audio 
10 Coding," J. Audio Eng. Soc, vol. 45, no. 10, October 1997, pp.789-814, and incorporated 
herein by reference. 

In AAC, a MDCT filter bank is applied to stationary source signal segments having a 
length equal to 2048 samples and is applied to non-stationary source signal segments having 
a length equal to 256 samples. Block switching is achieved in AAC by using "long window 

15 functions" that are appropriate for the longer segments, "short window functions" that are 
appropriate for the shorter segments, a "long-to-short bridging window function" that allows 
switching from the longer segment length to the shorter segment length, and a "short-to-long 
bridging window function" that allows switching from the shorter segment length to the 
longer segment length. The two bridging window functions allow switching between 

20 different segment length while satisfying the criteria necessary for proper operation of the 
MDCT. A switch from a longer segment length to a shorter segment length and back to the 
longer length is accomplished by applying the MDCT to a long segment using the long-to- 
short bridging window function, applyiiig the MDCT to an integer multiple of eight short 
segments using the short window function, and applying the MDCT to a long segment using 

25 the short-to-long bridging window function. Immediately thereafter, the MDCT must be 
applied to a long segment but the long window function may be used or the long-to-short 
bridging window function may be used if another block switch is desired. 

Although block switching does provide a way to adapt the time and frequency 
resolution of a MDCT filter bank, it is not an ideal solution for several reasons. One reason is 

30 that the frequency selectivity of the transform is degraded during a switch of block lengths 
because the shape of the bridge window functions must be suboptimal to allow segment- 
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length switching and to satisfy requirements for proper operation of the MDCT. Another 
reason is that a switch cannot occur at any arbitrary time. As explained above, the MDCT 
must be applied to another long segment inmiediately after switching to the longer segment 
length. An immediate switch to the shorter length is not possible. This block switching 
5 technique also is not an ideal solution because the switching mechanism provides only two 
segment lengths, which are not optimum for all signal conditions. For example, the two 
segment lengths in AAC are not optimal because neither the longer nor the shorter segment 
length in AAC is optimum for most speech signal segments. The 2048-sample segments are 
usually too long for the non-stationary nature of speech and the 256-sample segments are 

10 usually too short to remove redundant components effectively. Furthermore, there are many 
stationary signals for which a segment length longer than 2048 samples would be more 
optimum. As a result, the performance of AAC is impaired by the limited ability of block 
switching to adapt the time and frequency resolution of a MDCT filter bank. 

Another form of block switching is used in coding systems that conform to the Dolby 

15 Digital encoded bit stream standard. This coding standard, sometimes referred to as AC-3, is 
described in the Advanced Television Systems Committee (ATSC) A/52A document entitled 
"Revision A to Digital Audio Compression (AC-3) Standard" published August 20, 2001, 
and incorporated herein by reference. The form of block switching used in AC-3 coding 
systems applies a MDCT to source signal segments of either 512 samples for stationary 

20 signals or 256 samples for non-stationary signals. The block switching technique used in 
AC-3 coding systems provides more flexibility in choosing when length switches are made. 
Furthermore, coding performance is reasonably good for non-stationary source signals like 
speech; however, the coding performance for signals that are more stationary is limited by 
the relatively low frequency resolution provided by the longer segment. 

25 Other techniques for adaptive control of the time and frequency resolution of a 

MDCT filter bank are described in U.S. patent 5,394,473 by Davidson, which issued 
February 28, 1995 and are incorporated herein by reference. Some of these techniques allow 
a MDCT filter bank to be applied to segments of essentially any length using window 
fiinctions that provide much a better frequency response than is possible by other known 

30 techniques. Unfortunately, these techniques must adapt the kemel or basis ftinctions of the 
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MDCT and are, therefore, incompatible with existing bit stream standards like the AC-3 
standard mentioned above. These techniques are also computationally intensive. 

DISCLOSURE OF INVENTION 

5 What is needed is a more efficient and effective way to adapt the frequency resolution 

of filter banks implemented by transforms like the MDCT. Preferably, the solution should 
provide an implementation that simplifies its incorporation into systems that are compatible 
with existing bit stream standards. This is achieved by using a hybrid-transform filter bank 
that may be implemented by a cascade of block transforms. 

10 According to teachings of the present invention, analysis and synthesis filter banks 

are implemented by hybrid transforms that comprise a primary transform in cascade with one 
or more secondary transforms. In one implementation, the primary transform is a MDCT that 
is applied to source signal segments overlapping one another by one-half the segment length 
and the secondary transform is a DCT that is applied to non-overlapping blocks of MDCT 

15 coefficients for a particular frequency across time. The frequency resolution of the filter 
banks may be increased by increasing the number of coefficients in the blocks that are 
transformed by the one or more secondary transforms. The one or more secondary transforms 
may be applied to blocks of MDCT coefficients having a number of coefficients that varies 
with coefficient firequency, thereby allowing the frequency resolution of the filter bank to be 

20 adapted in a wide variety of ways. 

The various features of the present invention and its preferred embodiments may be 
better understood by referring to the following discussion and the accompanying drawings in 
which like reference numerals refer to like elements in the several figures. The contents of 
the following discussion and the drawings are set forth as examples only and should not be 

25 understood to represent limitations upon the scope of the present invention. 

BRIEF DESCRIPTION OF DRAWINGS 

Fig. 1 is a schematic block diagram of a transmitter used in a coding system. 
Fig. 2 is a schematic block diagram of a receiver used in a coding system. 
30 Fig. 3 is a schematic block diagram of a device that may be used to implement 

various aspects of the present invention. 
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Fig. 4 is a schematic block diagram of an analysis filter bank that incorporates various 
aspects of the present invention. 

Fig. 5 is a schematic block diagram of an analysis filter bank that incorporates various 
aspects of the present invention. 

5 

MODES FOR CARRYING OUT THE INVENTION 
A. Introduction 

The present invention provides a filter bank that is implemented by a hybrid 
transform whose fi-equency resolution can be easily adapted. Figs. 1 and 2 illustrate 
10 schematic block diagrams of a transmitter and receiver, respectively, in an audio coding 
system that may incorporate various aspects of the present invention. Features of the 
illustrated transmitter and receiver are discussed briefly in the following sections. Following 
this discussion, pertinent features of analysis and synthesis filter banks are discussed. 

1. Transmitter 

15 The transmitter illustrated in Fig. 1 applies the analysis filter bank 3 to a source signal 

received firom the path 1 to generate spectral coefficients that represent the spectral content of 
the source signal, applies the encoder 5 to the spectral coefficients to generate encoded 
information, and applies the formatter 8 to the encoded information to generate an output 
signal suitable for transmission along the path 9. The output signal may be delivered 

20 immediately to a receiver or recorded for subsequent delivery. The analysis filter bank 3 may 
be implemented in variety of ways as described below. 

In this disclosure, terms like "encoder" and "encoding" are not intended to imply any 
particular type of information processing. For example, encoding is often used to reduce 
information capacity requirements; however, these terms in this disclosure do not necessarily 

25 refer to this type of processing. The encoder 5 may perform essentially any type of 

processing that is desired. In one implementation, encoded information is generated by 
quantizing spectral coefficients according to a perceptual model using a wide variety of 
quantization techniques including vector quantization and gain-adaptive quantization 
described in U.S. patent 6,246,345 by Davidson et al., which issued June 12, 2001 and is 

30 incorporated by reference. No particular type of encoding is important to the present 
invention. 
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2. Receiver 

The receiver illustrated in Fig. 2 applies the deformatter 23 to an input signal received 
from the path 21 to obtain encoded information, applies the decoder 25 to the encoded 
information to obtain spectral coefficients representing the spectral content of a source 
5 signal, and applies the synthesis filter bank 27 to the spectral coefficients to generate an 
output signal along the path 29 that is a replica of the source signal but may not be an exact 
replica. The synthesis filter bank 27 may be implemented in a variety of ways that are 
complementary to the implementation of the analysis filter bank 3. 

In this disclosure, terms like "decoder" and "decoding" are not intended to imply any 
10 particular type of information processing. The decoder 25 may perform essentially any type 
of processing that is needed or desired. In one implementation that is inverse to an encoding 
process described above, quantized spectral components are decoded into dequantized 
spectral coefficients. No particular type of decoding is important to the present invention. 

B. Adaptive Hybrid Transform 
1 5 The analysis filter bank 3 and the synthesis filter bank 27 comprise hybrid transforms, 

which may be implemented as shown in Figs. 4 and 5, respectively. 

The analysis filter bank 3 shown in Fig. 4 comprises a primary transform 43 and one 
or more secondary transforms 45 in cascade with the primary transform. The primary 
transform is applied to segments of a source signal to generate sets of spectral coefficients 
20 that represent the spectral content of the source signal segments. Each of the one or more 
secondary transforms is applied to blocks of spectral coefficients for a particular frequency 
across time. The number of coefficients in each block is adapted in response to a control 
signal. 

The synthesis filter bank 27 shown in Fig. 5 comprises one or more inverse secondary 
25 transforms 52 and an inverse primary transform 54 in cascade with the inverse secondary 
transforms. Each of the one or more inverse secondary transforms generates blocks of 
spectral coefficients for a particular frequency across time. The number of coefficients in 
each block is adapted in response to a control signal. The blocks of spectral coefficients are 
assembled into sets of spectral coefficients for a particular time across frequency and the 
30 primary transform is applied to the sets of spectral coefficients to generate segments of a 
signal that are combined to provide a replica of an original source signal. 
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The primary transforms for the analysis fiUer bank and the synthesis filter bank 
implement an analysis/synthesis system in which an inverse primary transform cancels time- 
domain aUasing artifacts that are generated by the forward primary transform. For example, the 
Modified Discrete Cosine Transform (MDCT) and the Inverse MDCT (IMDCT) described in 
5 the Princen paper mentioned above implement the time-domain equivalent of an oddly- 
stacked critically sampled single-sideband analysis/synthesis system. These transforms are 
referred to herein as Oddly-Stacked Time-Domain Aliasing Cancellation (O-TDAC) 
transforms. Another TDAC implementation is described in Princen et al., "Analysis/Synthesis 
Filter Bank Design Based on Time Domain Aliasing Cancellation," IEEE Trans, on Acoust., 

10 Speech. Signal Proc. vol. ASSP-34, 1986, pp. 1 153-1 161. The analysis filter bank in this 

implementation comprises an application of a MDCT and a Modified Discrete Sine Transform 
(MDST) to altemate signal segments. The synthesis filter bank comprises an application of an 
IMDCT and an Inverse MDST (IMDST). These transforms implement the time-domain 
equivalent of an evenly-stacked critically sampled single-sideband analysis/synthesis system 

1 5 and are referred to as Evenly-Stacked Time-Domain Aliasing Cancellation transforais. 

The secondary transforms may be implemented by any of a number of transforms 
including the Discrete Cosine Transform (DCT), the Discrete Sine Transform (DST), and the 
Discrete Fourier Transform (DFT). 

In a preferred implementation of the analysis filter bank 3, a type- II DCT is used in 

20 cascade with the O-TDAC MDCT mentioned above. In a counterpart implementation of the 
synthesis filter bank 27, the O-TDAC IMDCT is used in cascade with a type-II Inverse DCT 
(IDCT). These implementations are discussed in more detail below. 

1. Analysis Filter Bank 
Referring to Fig. 4, a sequence of source signal samples are received fi-om the path 1 

25 and stored in the buffer 41 . The analyzer 47 is an optional component that determines the 
number of source signal samples in each segment, or segment length, to use for subsequent 
processing by applying some analytical process to the stored samples. Essentially any 
analytical process may be used as desired. For example, amplitude transients may be detected 
as described in the ATSC A/52A document cited above. Information representing chosen 

30 segment lengths is passed along the path 2 to the formatter 7 for inclusion in the output 
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signal. Fixed length segments are used in an alternative implementation that omits the 
analyzer 47 and the path 2. 

a) Analysis Window Function 

The window 42 forms a sequence of overlapping segments by weighting the source 
signal samples in each segment with an analysis window function. The length and shape of 
the analysis window function for each segment is adapted in response to the segment length 
information received from the path 2. A wide variety of window functions may be used but a 
Kaiser-Bessel-Derived (KBD) window function is generally preferred because it has 
excellent frequency selectivity properties. This window function is derived from a Kaiser- 
Bessel window function that may be expressed as: 



lo 

W{n)^ — 



V N/2 



{oTO<n<N (1) 



Io[^a] 

where a = Kaiser-Bessel alpha factor, 

n = window function sample number, 

N= window function length in number of samples, and 

An alpha value in the range from 4 to 7 works well for typical audio coding apphcations. 

The derivation convolves the Kaiser-Bessel window function W(n) with a rectangular 
window function having a length equal to the desired window function length minus an 
overlap interval v. See expression 2. This convolution may be simpHfied as shown in 
expression 3. 

WP{n)=^=2—^ for 0 < n < (2) 

±W{k) 

k=0 

N-vl 

WP{n)=-^^ for 0 < « < (3) 

±W{k) 

k=0 
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where v = number of samples within the segment overlap interval, 
N= desired length of the window function, 
W{n) = Kaiser-Bessel window function of length v+1, 
WP{n) = derived product- window of length N, and 

"l for 0<k<N-v 
0 otherwise 

The KBD analysis window function may be obtained by taking the square root of the 
derived product-window WP(n). This analysis window function is shown in expression 4. 



WA{n)= 



N/2-l 



T,W(n-k) 



k=0 



N/2 



{otO<n<N 



(4) 



k=0 



b) Primary Transform 
The primary transform 43 transforms each segment of windowed source signal 
samples into a set of spectral coefficients. Each coefficient in a set of coefficients represents 
the spectral content of a windowed segment for a particular frequency. The 0-TDAC MDCT 
is used in a preferred implementation. This transform may be expressed as: 



N-i r 

C{k,m) = ^x^ n m ' — y WA{n) ' cos 



n=0 



2k 



2) 



N 0 
n + — + - 
4 2) 



N 



for0^ifc<iV (5) 



where k = frequency coefficient number, 
n = input signal sample number, 
m = source signal segment number, 
N= source signal segment length, 
x(n) = value of source signal x at sample n, and 
C(k) = MDCT coefficient k. 
The 0-TDAC MDCT produces a set of spectral coefficients of the form 

N 



{C{k,m)}= 



C{k,m) forO<k- 



0 foxk = 



N 



(6) 
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The primary transform may be implemented directly according to expression 5 or it 
may be implemented by processes that are computationally more efficient such as those 
using the Fast Fourier Transform (FFT) described in U.S. patent 5,394,473. The analysis 
window function and the primary transform may be adapted in response to segment length 
5 using essentially any process that may be desired. A few techniques are disclosed in U.S. 
patent 5,214,742, U.S. patent 5,394,473, the ATSC A/52A document, and the ISO/MPEG 
AAC document cited above. 

Spectral coefficients representing the spectral content of the windowed source signal 
segments for each of one or more respective fi-equencies are passed along respective signal 

10 paths and stored in buffers. The transmitter shown in Fig. 4, for example, passes spectral 

coefficients for each of two respective frequencies along one of two signal paths for storage 
in the buffers 44a and 44b- Only two signal processing paths are shown in Fig. 4 for 
illustrative clarity. Implementations of the analysis filter bank 3 for use in typical systems 
could have hundreds of paths. 

1 5 c) Spectral Component Analysis 

Referring to the upper signal path shown in Fig. 4, spectral coefficients for a 
particular frequency in a sequence of segments are stored in the buffer 44a and assembled 
into blocks. The analyzer 48a determines the number of coefficients in each block, which is 
the block length, and passes this length along the path 49a. This length may be determined by 

20 analyzing the coefficients that are stored in the buffer 44a. No particular method of analysis 
is critical in principle to the present invention. A few analytical methods are described here. 

One basic method forms the longest possible blocks of spectral coefficients in which 
the coefficients in a respective block are sufficiently similar in magnitude. This may be 
determined in a variety of ways. One way calculates differences in magnitude between 

25 adjacent spectral coefficients and identifies the longest block of adjacent coefficients in 
which the average difference is less than some threshold. Another way uses spectral 
coefficients that are stored in buffers for multiple signal paths. This approach sums the 
magnitude differences for a band of spectral coefficients and identifies the longest block in 
which the average difference across the band is less than some threshold. The width of the 

30 band may be commensurate with the so called critical bandwidths of the human auditory 
system. 
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Another basic method relies on signal analysis that is performed by signal encoding 
processes performed elsewhere in a transmitter. A transmitter that is compatible with the bit 
stream standard described in the A/52A document cited above, for example, generates an 
encoded signal with spectral coefficients represented as scaled values that are associated with 
scale factors. The scale factors are analyzed to identify sequences of MDCT coefficient sets 
that can share a common set of scale factors. The analyzer 48a adapts the block length for its 
respective signal path to equal the number of coefficient sets that share exponents. 



The secondary transform 45a transforms each block of spectral coefficients into a set 
of hybrid-transforai coefficients. The length of the transform is adapted in response to the 
block length information received fi"om the path 49a. In a preferred implementation, a type II 
DCT is applied to blocks of spectral coefficients that do not overlap one another. This 
transform may be expressed as: 



The secondary transform may be implemented directly according to expression 7 or it 
may be implemented by known processes that are computationally more efficient such as 
those described in chapter 4 of Rao et al., "Discrete Cosine Transform," Academic Press, 
Inc., 1990. 



The formatter 46a is an optional component that may be used to assemble the hybrid- 
transform coefficients and block length information into data that the encoder 5 and the 
formatter 7 can process. This allows the analysis filter bank 3 in the transmitter shown in 
Fig. 1 to be implemented by a hybrid transform with minimal changes to the rest of the 
transmitter. 
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d) Secondary Transform 




(7) 



where X(kJ) = hybrid transform coefficient j for MDCT coefficient k, 
M= length of block of MDCT coefficients k, and 




e) Formatter and Other Signal Paths 



The buffer 44b, the analyzer 48b, the secondary transform 45b, and the formatter 46b 
perform processes in the lower signal path that are analogous to those discussed above for the 
respective components in the upper signal path. 

f) Encoding 

In typical systems, the encoder 5 generates encoded information that represents the 
hybrid-transform coefficients in some encoded form. If perceptual encoding processes are 
used, the hybrid-transform coefficients are encoded into a form that reduces perceptual 
irrelevancy. Perceptual encoding processes usually cause spectral information to be lost that 
cannot be recovered or recreated by the receiver. The possibility of this loss is represented 

below by the symbol 'X{k,j), which denotes a possible modification to the hybrid-transform 
coefficients generated by the hybrid transform. The use of such encoding processes are not 
critical to the present invention. 

2. Synthesis Filter Bank 

Referring to Fig. 5, the deformatters 51a and 51b obtain hybrid trmsform coefficients 
and block length information from data that is received from the paths 26a and 26b, 
respectively. The block length information is passed along the paths 59a and 59b, and the 
hybrid-transform coefficients are passed to the inverse secondary transforms 52a and 52b. 
Only two signal processing paths are shown in Fig. 5 for illustrative clarity. Implementations 
of the synthesis filter bank 27 for use in typical systems could have hundreds of paths. 

The deformatter 51b, the inverse secondary transform 52b, and the buffer 53b 
perform processes in the lower signal path that are analogous to those discussed below for 
the respective components in the upper signal path. 

a) Deformatter 

Referring to the upper signal path shown in Fig. 5, the deformatter 5 la is an optional 
component that may be used to disassemble sets of hybrid-transform coefficients and block 
length information from data that is received from the deformatter 23 and the decoder 25. 
This allows a hybrid-transform implementation of the synthesis filter bank 27 to be 
incorporated into an existing receiver as shown in Fig. 2 with minimal changes to the rest of 
the receiver. 
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b) Inverse Secondary Transform 



The inverse secondary transform 52a transforms a set of hybrid-transform coefficients 
into a block of spectral coefficients that represent the spectral content for a particular 
frequency of a sequence of source signal segments. The block of spectral coefficients are 
stored in the buffer 53a. The length of the transform is adapted in, response to the block 
length information received fi-om the path 59a. In a preferred implementation, a type II IDCT 
is applied to blocks of spectral coefficients that do not overlap one another. This transform 
may be expressed as: 



where C{k, m) = MDCT coefficient k obtained from encoded hybrid-transform coefficients. 

The inverse secondary transform may be implemented directly according to 
expression 8 or it may be implemented by known processes that are computationally more 
efficient. 



The buffers 53a and 53b store spectral coefficients and pass them to the inverse 
primary transform 54 in such a way that the inverse primary transform receives sets of 
spectral coefficients that represent the spectral content of respective source signal segments. 
Segments of signal samples are generated by applying an inverse transform to the sets of 
spectral coefficients and stored in the buffer 55. The length of the inverse primary transform 
is adapted in response to segment length information received from the path 22. The 
0-TDAC IMDCT is used in a preferred implementation. 

In typical applications, half of the MDCT coefficients are discarded in the transmitter. 
The discarded coefficients may be recovered by the receiver using the following expression: 




(8) 



c) Inverse Primary Transform 



C{k)=-C{N-k) foTN/2<k<N 
The 0-TDAC IMDCT may be expressed as: 



(9) 



COS 




foTO<n<N 



(10) 
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where x = recovered signal sample. 

The inverse primary transform may be implemented directly according to expression 
10 or it may be implemented by known processes that are computationally more efficient 
such as those using the FFT described in U.S. patent 5,394,473. 
5 d) Synthesis Window Function 

The window 56 generates an output signal along the path 29 by weighting the 
segments of signal samples that are stored in the buffer 55 with a synthesis window function 
and adding the weighted samples in overlapping segments to one another in the overlapping 
portions. The inverse primary transform, synthesis window function and the overlap-add 

10 process cancel at least a substantial portion of the time-domain aliasing artifacts that were 
generated by the forward transform. Cancellation may not be exact because of modifications 
to the transform coefficients that were caused by encoding processes and by finite arithmetic 
precision in the calculations of the primary and secondary transforms. The length and shape 
of the synthesis window function for each segment is adapted in response to the segment 

1 5 length information received from the path 22. A KBD window function that is equal to the 
analysis window function WA show above in expression 4 is used in a preferred 
implementation. 

The synthesis window function and the inverse primary transform may be adapted in 
response to the segment length information using processes such as those referred to above. 
20 C. Implementation 

The preceding disclosure sets forth only a few implementations. A variety of 
transforms and transform types may be used. Principles of the present invention may be 
applied and implemented in a wide variety of ways. 

Devices that incorporate various aspects of the present invention may be implemented 
25 in a variety of ways including software for execution by a computer or some other apparatus 
that includes more specialized components such as digital signal processor (DSP) circuitry 
coupled to components similar to those found in a general-purpose computer. Fig. 3 is a 
schematic block diagram of device 70 that may be used to implement aspects of the present 
invention. DSP 72 provides computing resources. RAM 73 is system random access memory 
30 (RAM) used by DSP 72 for signal processing. ROM 74 represents some form of persistent 

storage such as read only memory (ROM) for storing programs needed to operate device 70 and 
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to carry out various aspects of the present invention. I/O control 75 represents interface circuitry 
to receive and transmit signals by way of communication channels 76, 77. Analog-to-digital 
converters and digital-to-analog converters may be included in I/O control 75 as desired to 
receive and/or transmit analog signals. In the embodiment shown, all major system components 
5 connect to bus 7 1 , which may represent more than one physical bus; however, a bus architecture 
is not required to implement the present invention. 

In embodiments implemented in a general purpose computer system, additional 
components may be included for interfacing to devices such as a keyboard or mouse and a 
display, and for controlling a storage device having a storage medium such as magnetic tape or 

1 0 disk, or an optical medium. The storage medium may be used to record programs of instructions 
for operating systems, utilities and applications, and may include embodiments of programs that 
implement various aspects of the present invention. 

The functions required to practice various aspects of the present invention can be 
performed by components that are implemented in a wide variety of ways including discrete 

1 5 logic components, integrated circuits, one or more ASICs and/or program-controlled processors. 
The manner in which these components are implemented is not important to the present 
invention. 

Software implementations of the present invention may be conveyed by a variety of 
machine readable media such as baseband or modulated communication paths throughout the 
20 spectrum including from supersonic to ultraviolet frequencies, or storage media that convey 

information using essentially any recording technology including magnetic tape, cards or disk, 
optical cards or disc, and detectable markings on media like paper. 
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